35 research outputs found

    ArrayExpress—a public database of microarray experiments and gene expression profiles

    Get PDF
    ArrayExpress is a public database for high throughput functional genomics data. ArrayExpress consists of two parts—the ArrayExpress Repository, which is a MIAME supportive public archive of microarray data, and the ArrayExpress Data Warehouse, which is a database of gene expression profiles selected from the repository and consistently re-annotated. Archived experiments can be queried by experiment attributes, such as keywords, species, array platform, authors, journals or accession numbers. Gene expression profiles can be queried by gene names and properties, such as Gene Ontology terms and gene expression profiles can be visualized. ArrayExpress is a rapidly growing database, currently it contains data from >50 000 hybridizations and >1 500 000 individual expression profiles. ArrayExpress supports community standards, including MIAME, MAGE-ML and more recently the proposal for a spreadsheet based data exchange format: MAGE-TAB. Availability:

    A System for Information Management in BioMedical Studies—SIMBioMS

    Get PDF
    Summary: SIMBioMS is a web-based open source software system for managing data and information in biomedical studies. It provides a solution for the collection, storage, management and retrieval of information about research subjects and biomedical samples, as well as experimental data obtained using a range of high-throughput technologies, including gene expression, genotyping, proteomics and metabonomics. The system can easily be customized and has proven to be successful in several large-scale multi-site collaborative projects. It is compatible with emerging functional genomics data standards and provides data import and export in accepted standard formats. Protocols for transferring data to durable archives at the European Bioinformatics Institute have been implemented

    ArrayExpress—a public repository for microarray gene expression data at the EBI

    Get PDF
    ArrayExpress is a public repository for microarray data that supports the MIAME (Minimum Informa-tion About a Microarray Experiment) requirements and stores well-annotated raw and normalized data. As of November 2004, ArrayExpress contains data from ∼12 000 hybridizations covering 35 species. Data can be submitted online or directly from local databases or LIMS in a standard format, and password-protected access to prepublication data is provided for reviewers and authors. The data can be retrieved by accession number or queried by vari-ous parameters such as species, author and array platform. A facility to query experiments by gene and sample properties is provided for a growing subset of curated data that is loaded in to the ArrayExpress data warehouse. Data can be visualized and analysed using Expression Profiler, the integrated data analysis tool. ArrayExpress is available at http://www.ebi.ac.uk/arrayexpress

    ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments

    Get PDF
    The ArrayExpress Archive (http://www.ebi.ac.uk/arrayexpress) is one of the three international public repositories of functional genomics data supporting publications. It includes data generated by sequencing or array-based technologies. Data are submitted by users and imported directly from the NCBI Gene Expression Omnibus. The ArrayExpress Archive is closely integrated with the Gene Expression Atlas and the sequence databases at the European Bioinformatics Institute. Advanced queries provided via ontology enabled interfaces include queries based on technology and sample attributes such as disease, cell types and anatomy

    A call for public archives for biological image data

    Get PDF
    Public data archives are the backbone of modern biological and biomedical research. While archives for biological molecules and structures are well-established, resources for imaging data do not yet cover the full range of spatial and temporal scales or application domains used by the scientific community. In the last few years, the technical barriers to building such resources have been solved and the first examples of scientific outputs from public image data resources, often through linkage to existing molecular resources, have been published. Using the successes of existing biomolecular resources as a guide, we present the rationale and principles for the construction of image data archives and databases that will be the foundation of the next revolution in biological and biomedical informatics and discovery.Comment: 13 pages, 1 figur

    A Cross-Species Analysis of a Mouse Model of Breast Cancer-Specific Osteolysis and Human Bone Metastases Using Gene Expression Profiling

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Breast cancer is the second leading cause of cancer-related death in women in the United States. During the advanced stages of disease, many breast cancer patients suffer from bone metastasis. These metastases are predominantly osteolytic and develop when tumor cells interact with bone. <it>In vivo </it>models that mimic the breast cancer-specific osteolytic bone microenvironment are limited. Previously, we developed a mouse model of tumor-bone interaction in which three mouse breast cancer cell lines were implanted onto the calvaria. Analysis of tumors from this model revealed that they exhibited strong bone resorption, induction of osteoclasts and intracranial penetration at the tumor bone (TB)-interface.</p> <p>Methods</p> <p>In this study, we identified and used a TB microenvironment-specific gene expression signature from this model to extend our understanding of the metastatic bone microenvironment in human disease and to predict potential therapeutic targets.</p> <p>Results</p> <p>We identified a TB signature consisting of 934 genes that were commonly (among our 3 cell lines) and specifically (as compared to tumor-alone area within the bone microenvironment) up- and down-regulated >2-fold at the TB interface in our mouse osteolytic model. By comparing the TB signature with gene expression profiles from human breast metastases and an <it>in vitro </it>osteoclast model, we demonstrate that our model mimics both the human breast cancer bone microenvironment and osteoclastogenesis. Furthermore, we observed enrichment in various signaling pathways specific to the TB interface; that is, TGF-β and myeloid self-renewal pathways were activated and the Wnt pathway was inactivated. Lastly, we used the TB-signature to predict cyclopenthiazide as a potential inhibitor of the TB interface.</p> <p>Conclusion</p> <p>Our mouse breast cancer model morphologically and genetically resembles the osteoclastic bone microenvironment observed in human disease. Characterization of the gene expression signature specific to the TB interface in our model revealed signaling mechanisms operative in human breast cancer metastases and predicted a therapeutic inhibitor of cancer-mediated osteolysis.</p

    The discovAIR project: a roadmap towards the Human Lung Cell Atlas

    Get PDF
    The Human Cell Atlas (HCA) consortium aims to establish an atlas of all organs in the healthy human body at single-cell resolution to increase our understanding of basic biological processes that govern development, physiology and anatomy, and to accelerate diagnosis and treatment of disease. The Lung Biological Network of the HCA aims to generate the Human Lung Cell Atlas as a reference for the cellular repertoire, molecular cell states and phenotypes, and cell–cell interactions that characterise normal lung homeostasis in healthy lung tissue. Such a reference atlas of the healthy human lung will facilitate mapping the changes in the cellular landscape in disease. The discovAIR project is one of six pilot actions for the HCA funded by the European Commission in the context of the H2020 framework programme. discovAIR aims to establish the first draft of an integrated Human Lung Cell Atlas, combining single-cell transcriptional and epigenetic profiling with spatially resolving techniques on matched tissue samples, as well as including a number of chronic and infectious diseases of the lung. The integrated Human Lung Cell Atlas will be available as a resource for the wider respiratory community, including basic and translational scientists, clinical medicine, and the private sector, as well as for patients with lung disease and the interested lay public. We anticipate that the Human Lung Cell Atlas will be the founding stone for a more detailed understanding of the pathogenesis of lung diseases, guiding the design of novel diagnostics and preventive or curative interventions

    The EU-ToxRisk method documentation, data processing and chemical testing pipeline for the regulatory use of new approach methods

    Get PDF
    Hazard assessment, based on new approach methods (NAM), requires the use of batteries of assays, where individual tests may be contributed by different laboratories. A unified strategy for such collaborative testing is presented. It details all procedures required to allow test information to be usable for integrated hazard assessment, strategic project decisions and/or for regulatory purposes. The EU-ToxRisk project developed a strategy to provide regulatorily valid data, and exemplified this using a panel of > 20 assays (with > 50 individual endpoints), each exposed to 19 well-known test compounds (e.g. rotenone, colchicine, mercury, paracetamol, rifampicine, paraquat, taxol). Examples of strategy implementation are provided for all aspects required to ensure data validity: (i) documentation of test methods in a publicly accessible database; (ii) deposition of standard operating procedures (SOP) at the European Union DB-ALM repository; (iii) test readiness scoring accoding to defined criteria; (iv) disclosure of the pipeline for data processing; (v) link of uncertainty measures and metadata to the data; (vi) definition of test chemicals, their handling and their behavior in test media; (vii) specification of the test purpose and overall evaluation plans. Moreover, data generation was exemplified by providing results from 25 reporter assays. A complete evaluation of the entire test battery will be described elsewhere. A major learning from the retrospective analysis of this large testing project was the need for thorough definitions of the above strategy aspects, ideally in form of a study pre-registration, to allow adequate interpretation of the data and to ensure overall scientific/toxicological validity.Toxicolog

    Structural Similarity Mining in Semi-structured Microarray Data for Efficient Storage Construction

    No full text

    BioStudies database - filling in the gaps between publications and structured datasets

    No full text
    BioStudies is a new database at the European Bioinformatics Institute (EMBL-EBI) that aims to fill the gap between publications on one hand, and data sets in the ?traditional?, structured data archives available to life scientists on the other hand. It is able to accept and make available data from new and emergent technologies where data is produced in formats not (yet) supported by the current EMBL-EBI data resources. BioStudies is also able to link to data in other databases, as well as accept what is usually treated as supplementary data, creating a data ?package? that can be associated with a publication.<br>The basic BioStudies model is very simple and allows to describe files and external links associated with a study, organize information in hierarchies, and attach annotation as appropriate. A template mechanism can be used to tighten the requirements on study description, as agreed for a particular domain/community/project. Extra functionality can be added for specific purposes, such as the compound view in the Data Infrastructure for Chemical Safety (diXa) project. The user interface enables ontology-driven query expansion.<br>Moving forward, we have started discussions with publishers on streamlining the treatment of publication-to-data links, and on providing workflows to paper authors making it easier to create datasets as a part of the manuscript preparation process, as well as to cite existing datasets. We hope that BioStudies will provide a significant contribution to effective data management, meeting the requirements of life sciences research in the future.<br><br
    corecore